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(ii) TITLE OF INVENTION: Modulators 

(TRAF), their Preparation "and 

(iii) NUMBER OF SEQUENCES: 11 



TNF Receptor Associated Factor 
Jse 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS ' 

(D) SOFTWARE: Patentln Release #1.0 A Version #1.30 (EPO) 



(v) 
(vi) 




(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: IL 119133 

(B) FILING DATE: 26-AUG-1996 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CATTGGGTCA CGCGGTGGCG GCGCTCTAGA ATAGTGGATC CCCCGGGCTG CAGGAATTCG 60 

ATTCGAGGCC ACGAAGGCCG GCGGCGCGGC GCANGCACCG GCCCGGGGAN AGGCNCCATG 120 

AGCGGATCNC NGAACNATGA CAAAAGACAA TTTCTGCTGG AGCGACTGCT GGATGCAGTG 180 

AAACAGTGCC AGATCCGCTT TNGAGGGAGA AAGGAGATTG CCTCGGATTC CGACAGCAGG 240 

GTCACCTGTC TGTGTGCCCA GTTTGAAGCC GTCCTGCAGC ATGGCTTGAA GAGGAGTCGA 300 

GGATTGGCAC TCACAGCGGC AGCGATCAAG CAGGCAGCGG GCTTTGCCAG CAAAACCGAA 3 60 

ACAGAGCCCG TGTTCTGGTA CTACGTGAAG GAGGTCCTCA ACAAGCACGA GCTGCAGCGC 420 

TTCTACTCCC TGCGCCACAT CGCCTCAGAC 'GTGGGCCGGG GTCGCGCCTG GCTGCGCTGT 4 80 

GCCCTCAACG.AACACTCCCT GGAGCGCTAC CTGCACATGC TCCTGGCCGA CCGCTGCAGG 54 0 

CTGAGCACTT TTTATGAAGA CTGGTCTTTT GTGATGGATG AAGAAAGGTC CAGTATGCTT 600 

CCTACCATGG CAGCAGGTCT GAACTCCATA CTCTTTGCGA TTAACATCGA CAACAAGGAT 660 

TTGAACGGGC AGAGTAAGTT TGCTCCCACC GTTTCAGACC TCTTAAAGGA GTCAACGCAG 720 

AACGTGACCT CCTTGCTGAA GGAGTCCACG CAAGGAGTGA GCAGCCTGTT CAGGGAGATC 78 0 

ACAGCCTCCT CTGCCGTCTC CATCCTCATC AAACCTGAAC AGGAGACCGA CCCTTGCCTG 84 0 

TCGTGTCCAG GAATGTCAGT GCTGATGCCA AATGCAAAAA GGAGCGGAAG AAGAAAAAGA 900 

AAGTGACCAA CATAATCTCA TTTGATGATG AGGAAGATGA GCAGAACTCT GGGGACGTGT 960 

T T AAAAAG AC ACCTGGGGCA GGGGAGAGCT CAGAGGACAA CTCCGACCGC TCCTCTGTCA 102 0 

ATATCATGTC CGCCTTTGAA AGCCCCTTCG GGCCTAACTC . CAATGGAATC AGAGCAGCAA 1080 

CTCATGGAAA ATTGATTCCC TGTCTTTGAA CGGGGAGTTT GGGTACCAGA AGCTTGATGT 114 0 

GAAAAGCATC GATGATGAAG ATGTGGATGA AAACGAAGAT GACGTGTATG GAAACTCATC 1200 



j^^GGAGTC GCCCGAGAAG CCACTGGAAG ^^^< 



AGGAAGGAAG CACAGGGGCC ^^TCGGAGTC GCCCGAGAAG CCACTGGAAG GGXACACCTG 12 60 

CCTCTCCCAG ATGCACAGCT GGGCTCCGCT GAAGGTGCTG CACAATGACT CCGACATCCT 1320 

CTTCCCTGTC AGTGGCGTGG GCTCCTACAG CCCAGCAGAT GCCCCCCTCG GAAGCCTGGA 1380 

GAACGGGACA GGACCAGAGG ACCACGTTCT CCCGGATCCT GGACTTCGGT ACAGTGTGGA 14 4 0 

AGCCAGCTCT CCAGGCCACG GAAGTCCTCT GAGCAGCCTG TTACTTCTGC CTCAGTGCCA 1500 

GAGTCCATGA CAATTAGTGA ACTGCGCCAG GCCACTGTGG CCATGATGAA CAGGAAGGAT 15 60 

GAGCTGGAGG AGGAGAACAG ATCACTGCGA AACCTGCTCG ACGGTGAGAT GGAGCACTCA 1620 

GCCGCGCTCC GGCAAGAGGT GGACACCTTG AAAAGGAAGG TGGCTGAACA GGAGGAGCGG 168 0 

CAGGGCATGA AGGTCCAGGC GCTGGCCAGC TATCTTTGCT ATTTTGTGAG GAGATTCTAA 17 4 0 

CCCCACGTGA GAACCATGTG GTGGAGAAAT GGAGGGAGAG AGAAATCCAA CAGTTCCTGA 18 00 

TAGTCTCATT TGAGCTCCTG GATCCAGTCT TTCCTGAAGC TGTGTTTCCT CTGGACTTTT 18 60 

CATGTATGTG AGCCAATAAA TTGCTTTCAT TCCTTGAAAA AAAAAA 190 6 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Thr Gly Pro Gly Xaa Gly Xaa Met Ser Gly Ser Xaa Asn Xaa Asp 

15 10 15 

Lys Arg Gin Phe Leu Leu Glu Arg Leu Leu Asp Ala Val Lys Gin Cys 
20 25 30 

Gin He Arg Phe Xaa Gly Arg Lys Glu He Ala Ser Asp Ser Asp Ser 
35 40 45 

Arg Val Thr Cys Leu Cys Ala Gin Phe Glu Ala Val Leu Gin His Gly 
50 55 60 

Leu Lys Arg Ser Arg Gly Leu Ala Leu Thr Ala Ala Ala He Lys Gin 
65 70 75 80 

Ala Ala Gly Phe Ala Ser Lys Thr Glu Thr Glu Pro Val Phe Trp Tyr 

85 90 95 

Tyr Val Lys Glu Val Leu Asn Lys His Glu Leu Gin Arg Phe Tyr Ser 
100 105 HO 

Leu Arg His He Ala Ser Asp Val Gly Arg Gly Arg Ala Trp Leu Arg 



115 



120 



125 



Cys Ala Leu Asn Glu His Ser Leu Glu Arg Tyr Leu His Met Leu Leu 
130 135 140 

Ala Asp Arg Cys Arg Leu Ser Thr Phe Tyr Glu Asp Trp Ser Phe Val 
145 150 155 160 

Met Asp Glu Glu Arg Ser Ser Met Leu Pro Thr Met Ala Ala Gly Leu 
165 170 175 

Asn Ser He Leu Phe Ala He Asn He Asp Asn Lys Asp Leu Asn Gly 
180 185 190 

Gin Ser Lys Phe Ala Pro Thr Val Ser Asp Leu Leu Lys Glu Ser Thr 
195 200 205 

Gin Asn Val Thr Ser Leu Leu Lys Glu Ser Thr Gin Gly Val Ser Ser 
210 215 220 

Leu Phe Arg Glu He Thr Ala Ser Ser Ala Val Ser He Leu He Lys 
225 230 235. 240 

Pro Glu Gin Glu Thr Asp Pro Cys Leu Ser Cys Pro Gly Met Ser Val 
245 250 255 

Leu Met Pro Asn Ala Lys Arg Ser Gly Arg Arg Lys Arg Lys Xaa Pro 
260 265 270 

Thr Xaa Ser His Leu Met Met Arg Lys Met Ser Arg Thr Leu Gly Thr 
275 280 285 

Cys Leu Lys Arg His Leu Gly Gin Gly Arg Ala Gin Arg Thr Thr Pro 
290 295 300 

Thr Ala Pro Leu Ser He Ser Cys Pro Pro Leu Lys Ala Pro Ser Gly 
305 310 315 320 

Leu Thr Pro Met Glu Ser Glu Gin Gin Leu Met Glu Asn Xaa Phe Pro 
325 330 335 

Val Phe Glu Arg Gly Val Trp Val Pro Glu Ala Xaa Cys Glu Lys His 
340 345 350 

Arg Xaa Xaa Arg Cys Gly Xaa Lys Arg Arg Xaa Arg Val Trp Lys Leu 
355 360 365 

lie Arg Lys Glu Ala Gin Gly Pro Leu Gly Val Ala Arg Glu Ala Thr 
370 375 380 

Gly Arg Glu His Leu Pro Leu Pro Asp Ala Gin Leu Gly Ser Ala Glu 
385 390 395 * 400 

Gly Ala Ala Gin Xaa Leu Arg His Pro Leu Pro Cys Gin Trp Arg Gly 



405 



410 



415 



Leu Leu Gin Pro Ser Arg Cys Pro Pro Arg Lys Pro Gly Glu Arg Asp 
420 425 430 

Arg Thr Arg Gly Pro Arg Ser Pro Gly Ser Trp Thr Ser Val Gin Cys 
435 440 445 



Gly Ser Gin Leu Ser Arg Pro Arg Lys Ser Ser Glu Gin Pro Val Thr 
450 455 460 

Ser Ala Ser Val Pro Glu Ser Met Thr He Ser Glu Leu Arg Gin Ala 
465 470 475 480 

Thr Val Ala Met Met Asn Arg Lys Asp Glu Leu Glu Glu Glu Asn Arg 
485 490 495 

Ser Leu Arg Asn Leu Leu Asp Gly Glu Met Glu His Ser Ala Ala Leu 
500 505 510 

Arg Sin Glu Val Asp Thr Leu Lys Arg Lys Val Ala Glu Gin Glu Glu 
515 520 525 

Arg Gin Gly Met Lys Val Gin Ala Leu Ala Ser Tyr Leu Cys Tyr Phe 
530 535 540 

Val Arg Arg Phe Xaa Pro His Val Arg Thr Met Trp Trp Arg Asn Gly 
545 550 555 560 

Gly Arg Glu Lys Ser Asn Ser Ser Xaa Xaa Ser His Leu Ser Ser Trp 
565 570 575 

He Gin Ser Phe Leu Lys Leu Cys Phe Leu Trp Thr Phe His Val Cys 
580 585 590 

Glu Pro He Asn Cys Phe His Ser Leu Lys Lys Lys 
595 600 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



CCCCTCTCAC 


AGCCCAGGCC 


ATCCAAGAGG 


GGCTGAGGAA 


AGAGCCCATC 


CACCGCGTGT 


60 


CTGCAGCGGA 


GCTGGGAGGG 


AAGGTGAACC 


GGGCACTACA 


GCAAGTGGGA 


GGTCTGAAGA 


120 


GCCCTTGGAG 


GGGAGAATAT 


AAAGAACCAA 


GACATCCACC 


GCCAAATCAA 


GCCAATTACC 


180 


ACCAGACCCT 


CCATGCCCAG 


CCGAGAGAGC 


TTTCGCCAAG 


GGCCCCAGGG 


CCCCGGCCAG 


240 


CTGAGGAGAC 


AACAGGCAGA 


GCCCCTAAGC 


TCCAGCCTCC 


TCTCCCACCA 


GAGCCCCCAG 


300 


AGCCAAACAA 


GTCTCCTCCC 


TTGACTTTGA 


GCAAGGAGGA 


GTCTGGGATG 


TGGGAACCCT 


360 


TACCTCTGTC 


CTCCCTGGAG 


CCAGCCCCTG 


CCAGAAACCC 


CAGCTCACCA 


GAGCGGAAAG 


420 


CAACCGTCCC 


GGAGCAGGAA 


CTGCAGCAGC 


TGGAAATAGA 


ATTATTCCTC 


AACAGCCTGT 


480 



CCCAGCCATT TTCTCTGGAG 
TCTCCCTGTC GGATGACAGT 
CCCTGAGCTC AGGCGTACAC 
ACATGGTGCT GGCCCGGGGG 
TCCAAATACA GTCTCTTAAT 
TGGGAGACAT CGCCACTGGC 
CCAAAGACGG GCAGCCTGTT 
AGTGCACACT GGCCCCTGAT 
TGGAGAACAG GCCCTAACCC 
TCCTGCTCGG TGCACGATGC 
NCCAGCCCCC CGGCTCARCA 
CAAGCNAGAA TGCCTCCCAG 
AAACAYTNCC GCCACGTGAA 
ACAAAACAGG GATCTTTNTT 
ANTCAGTGAC CATTTGTTGG 
GTGGGCGAGC CCTTCGGCCC 
AGGGNCCCAA ANCTCAGGNT 
GGTTAANNGG GGGCCCTCTN 
CTTTTGGGTG TAGGGGAAAA 
CACACTTTTC AGGTTGTTGC 
AAAGAAGGTG TGTAAGTGAA 
GCTACCACTC TTCCCCAGAG 
ACTCGCTGGC ACTCAGTTCC 
CAGGAACAGT CTGTGGATGG 
TCCGGCGCCC CAGNCCCCAC 
GCTNCAGNCA TCANCACTGA 
CCGNACGGCA CTTTGCACNT 
GCAGGGNCAG GGNCAGGGNC 
GGTGNAAGGG ANCACAGTCT 
NCCAGNATTT CTCTAAGAAT 




GAGCAGGAGC AAATTCTCTC GTGCCTCAGC ATCGACAGCC 54 0 

GAGAAGAACC CATCAAAGGC CTCTCAAAGC TCGCGGGACA 600 

TCCTGGAGCA GCCAGGCCGA GGCTCGAAGC TCCAGCTGGA 6 60 

CGGCCCACCG ACACCCCAAG CTATTTCAAT GGTGTGAAAG 7 20 

GGTGAACACC TGCACATCCG GGAGTTCCAC CGGGTCAAAG 780 

ATCAGCAGCC AGATCCCAGC TGCAGCCTTC AGCTTGGTCA 84 0 

CGCTACGACA TGGAGGTGCC AGACTCGGGC ATCGACCTGC 900 

GGCAGCTTCG CCTGGAGCTG GAGGGTCAAG CATGGCCAGC 960 

TGCCCTCCAC CGCCGGCTCC ACACTGCCGG AAAGCAGCCT 1020 

TGCCCTGAAA ACACAGGCTC AGCCGTTCCC AGGGGATYTG 108 0 

GNTGGGAACC AGGGCCTCGN CAGCNAGCNA AGGTNGGGGG 114 0 

GATTTCACAN CCTGAGCCCN TGCCCCANCC CTGCTGAADA 12 00 

GAGACAGAAG GAGGATGGNC AGGAGTTNNA CCTYGGGGAA 12 60 

CTGCCCCTGC TCCAGTNCGA GTTGGCCTGN ACCCGCTTGG 1320 

CAGANCAGGG GAGAGCAGCT TCCAGCCTGG GTCAGAAGGG 1380 

CTCACCCTNC CAGGCTGCTG TGNAGAGTGT CAAGTGTGTA 14 4 0 

TCAGTGCAGA ACCAGGTNCA GCAGGTATGC CCGCCCGNTA 1500 

AAACCCCTTG CCTNGGCCTN CACCTNGGCC AGCTCANCCC 15 60 

GAATGCCTGA CCCTGGGAAG GCTWCCCTGG TAGAATACAC 1620 

AACACAGGTC CTGAGTTGAC CTCTGGTTCA GCCAAGGACC 168 0 

GTGGTTCTCA GTNCCCCAGA CATGTGCCCC TTTGCTGCTG 17 4 0 

CAGCAGGCCC CGAGCCCCTT CAGGCCCAGC ACTGCCCCAG 1800 

CTCATCTGTA AAGGTGAAGG GTGATGCAGG ATATGCCTGA 18 60 

AC AT GAT C AG TGCTNAAGGN AAAGCAGCAG AG AG AG AC GY 1920 

TNATCAGTGT NCCAGCGTGC TNGGTTNCCC CAGNAGCACA 198 0 

CACTNCACCC TNGCCCTGCC CCTNGGCCAN GAGGGTACTG 204 0 

CTGATGNACC TCAAAGCACT TTCATGGCTN GCCCTCTNNG 2100 

AGTGACANCT GTAGGNAGCA TANGCAANGC CAGGAGATGG 2160 

TGAGCTGTCC ANCATGCATG TGACTNCCTC AAACCTCTTN 222 0 

AGCANCCCCC TTNCCCCATT GCCCCAGCTT AGCCTCTTCT 2 280 




CCCAGGGGAG CTANCTCAGG OT^ACGTAG CATTAAATCA GCTGTGNAAT CGTCAGGGGG 2 34 0 

TGTCTGCTAG CCTCAACCTC CTGGGGCAGG GGACGCCGAG ACTCCGTGGG AGAAGCTCAT 2 4 00 

TCCCACATCT TGCCAAGACA GCCTTTNGTC CAGCTGTCCA CATTGAGTCA GACTGCTCCC 2 4 60 

GGGGAGAGAG CCCCGGCCCC CAGCACATAA AGAACTGCAG CCTTGGTACT GCAGAGTCTG 2 520 

GGTTGTAGAG AACTCTTTGT AAGCAATAAA GTTTGGGGTG ATGACAAATG TTAAAAAAAG 2 580 

GCCTTCGTGG CCTCGAATCA AGCTTATCGA TACCGTCGAC CTCGAGGGGG G 2 631 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CATTGGAGTC ACGCGGTGGC GGCGCTCTAG AATAGTGGAT CCCCGGGCTG CANGGAATTC 60 

GATTCGAGCC CACGAAGGCC CCTTCTTCTG TGGTCGCGGC ACGTTTACAG CCGCAAGCAC 120 

CCAGCGGCAG CTGAAGGAGG CTTTTGAGAG GCTCCTGCCC CAGGTGGAGG CGGCCCGCAA 180 

GGCCATCCGC GCCGCTCAGG TGGAGCGCTA TGTGCCCGAA CACGAGCGAT GCTGCTGGTG 24 0 

CCTGTGCTGC GGCTGTG/aGG TGCGGGAACA CCTGAGCCAT GGAAACCTGA CGGTGCTGTA 300 

CGGGGGGCTG CTGGAGCATC TGGCCAGCCC AGAGCACAAG AAAGCAACCA ACAAATTCTG 360 

GTGGGAGAAC AAAGCTGAGG TCCAGATGAA AGAGAAGTTT CTGGTCACTC CCCAGGATTA 420 

TGCGCGATTC AAGAAATCCA TGGTGAAAGG TTTGGATTCC TATGAAGAAA AGGAGGATAA 4 80 

AGTGATCAAG GAGATGGCAG CTCAGATCCG TGAGGTGGAG C AG AGCCG AC AGGAGGTGGT 54 0 

TCGGTCTGTC TTAGAGCCTC AGGCAGTGCC AGACCCAGAA GAGGGCTCTT CAGCACCTAG 600 

AAGCTGGAAA GGGATGAACA GCCAAGTAGC TTCCAGCTTA CAGCAGCCCT CAAATTTGGA 660 

CCTGCCACCA GCTCCAGAGC TTGACTGGAT GGAGACAGGA CCATCTCTGA CATTCATTGG 720 

CCATCAGGAT ATACCAGGAG TTGGTAACAT CCACTCAGGT GCCACACCTC CCTGGATGAT 7 80 

CCAAGATGAA GAATACATTG CTGGGAACCA AGAAATAGGA CCATCCTATG AAGAATTTCT 840 

TAAAGAAAAG GAAAAACAGA AGTTGAAAAA ACTCCCCCCA GACCGAGTTG GGGCCAACTT 900 

TGATCACAGC TCCAGGACCA GTGCAGGCTG GCTGCCCTCT TTTGGGCCGC GTCTGGAATA 960 

ATGGACGCCG CTGGCAGTCC AGACATCAAC TCCAAAACTG AAGCTGCAGC AATGAAGAAG 1020 



CAGTCACATA CAGAAAAAAG cStTCATGC TCTCTACCAA CTACCATGAG GClAAAAGCC 108 0 

AAAGTCAACC AAACCCCTAT TATACCTTCC ACCCAAATTC TTTATCATTG TCTTTCTTAG 1140 

GAAACAGACA TACTCATTCA TTTGATTTAA TAAAGTTTTA TTTTTCGGCC TTCGTGGCCT 1200 

CGAATCAAGC TTATCGATAC CGTCGACCTC GAGGGGGGGC CGTACCCACT TTT 12 53 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 17 amino acids 
{B) TYPE: amino acid 
"(C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

lie Gly Val Thr Arg Trp Arg Arg Ser Arg lie Val Asp Pro Arg Ala 
15 10 15 

Ala Xaa Asn Ser He Arg Ala His Glu Gly Pro Phe -Phe Cys Gly Arg 
20 25 30 

Gly Thr Phe Thr Ala Ala Ser Thr Gin Arg Gin Leu Lys Glu Ala Phe 
35 40 45 

Glu Arg Leu Leu Pro Gin Val Glu Ala Ala Arg Lys Ala He Arg Ala 
50 55 60 

Ala Gin Val Glu Arg Tyr Val Pro Glu His Glu Arg Cys Cys Trp Cys 
65 70 75 80 

Leu Cys Cys Gly Cys Glu Val Arg Glu His Leu Ser His Gly Asn Leu 
- 85 90 95 

Thr Val Leu Tyr Gly Gly Leu Leu Glu His Leu Ala Ser Pro Glu His 
100 105 HO 

Lys Lys Ala Thr Asn Lys Phe Trp Trp Glu Asn Lys Ala Glu Val Gin 
115 120 125 

Met Lys Glu Lys Phe Leu Val Thr Pro Gin Asp Tyr Ala Arg Phe Lys 
130 135 140 

Lys Ser Met Val Lys Gly Leu Asp Ser Tyr Glu Glu Lys Glu Asp Lys 
145 150 155 160 

Val He Lys Glu Met Ala Ala Gin He Arg Glu Val Glu Gin Ser Arg 
165 170 175 

Gin Glu Val Val Arg Ser Val Leu Glu Pro Gin Ala Val Pro Asp Pro 
180 185 190 

Glu Glu Gly Ser Ser Ala Pro Arg Ser Trp Lys Gly Met Asn Ser Gin 
195 200 205 



Val Ala Ser Ser 
210 

Pro Glu Leu Asp 
225 

His Gin Asp lie 



Pro Trp Met lie 
260 

Gly Pro Ser Tyr 
275 

Lys Lys Leu Pro 
290 

Arg Thr Ser Ala 
305 

Trp Thr Pro Leu 



Gin Xaa Arg Ser 
340 

Gin Leu Pro Xaa 
355 

Leu Pro Pro Lys 
370 

Ser Phe lie Xaa 
385 

Glu Ser Ser Leu 



Phe 




Leu Gin Gin Pro 
215 

Trp Met Glu Thr 
230 

Pro Gly Val Gly 
245 

Gin Asp Glu Glu 



Glu Glu Phe Leu 
280 

Pro Asp Arg Val 
295 

Gly Trp Leu Pro 
310 

Ala Val Gin Thr 
325 

Ser His lie Gin 



Gly Xaa Lys Pro 
360 

Phe Phe lie lie 
375 

Phe Asn Lys Val 
390 

Ser lie Pro Ser 
405 



Ser Asn Leu Asp 
220 

Gly Pro Ser Leu 
235 

Asn lie His Ser 
250 

Tyr lie Ala Gly 
265 

Lys Glu Lys Glu 



Gly Ala Asn Phe 
300 

Ser Phe Gly Pro 
315 

Ser Thr Pro Lys 
330 

Lys Lys Ala Asn 
345 

Lys Ser Thr Lys 



Val Phe Leu Arg 
380 

Leu Phe Phe Gly 
395 

Thr Ser Arg Gly 
410 




Leu Pro Pro Ala 



Thr Phe lie Gly 
240 

Gly Ala Thr Pro 
255 

Asn Gin Glu lie 
270 

Lys Gin Lys Leu 
285 

Asp His Ser Ser 



Arg Leu Glu Xaa 
320 

Leu Lys Leu Gin 
335 

His Ala Leu Tyr 
350 

Pro Leu Leu Tyr 
365 

Lys Gin Thr Tyr 



Leu Arg Gly Leu 
400 

Gly Arg Thr His 
415 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AGCGGGGGGA CTGTGCCGTG TGGAACGTGT AGCTGTTGAA GGTGGACTCT GTTACCATTG 
AGGATGTTTG GAGGATGAGT ATGTGTGGCA GAGGCACACA TAAACAGGCA GAGACCCTTT 



60 
120 




GCCCCTGCCT TTCTCCCCCA ACCCAAGGCT GACCTGTGTT CTCCCAGGTC TGGGATTCTA 18 0 

AGTGACCTGC TCTGTGTTTG GTCTCTCTCA GGATGAGCAC AAGCCTGGGA GATGGCAGTG 24 0 

ATGGAAATGG CCTGCCCAGG TGCCCCTGGC TCAGCAGTGG GGCAGCAGAA GGAACTCCCC 300 
AAGCCAAAGG AGAAGACGCC GCCACTGGGG AAGAAACAGA GCTCCGTCTA CAAGCTTGAG 3 60 

GCCGTGGAGA AGAGCCCTGT GTTCTGCGGA AAGTGGGAGA TCCTGAATGA CGTGATTACC 4 20 

AAGGGCACAG CCAAGGAAGG CTCCGAGGCA GGGCCAGCTG CCATCTCTAT CATCGCCCAG 4 80 

GCTGAGTGTG AGAATAGCCA AGAGTTCAGC CCCACCTTTT CAGAACGCAT TTTCATCGCT 54 0 

GGGTCCAAAC AGTACAGCCA GTCCGAGAGT CTTGATCAGA TCCCCAACAA TGTGGCCCAT 600 
GCTACAGAGG GCAAAATGGC CCGTGTGTGT TGGAAGGGAA AGCGTCGCAG CAAAGCCCGG 660 

AAGAAAC GG A AGAAGAAGAG CTCAAAGTCC CTGGCTCATG CAGGAGTGGC CTTGGCCAAA 7 20 

CCCCTCCCCA GGACCCCTGA GCAGGAGAGC TGCACCATCC CAGTGCAGGA GGATGAGTCT 7 80 

CCACTCGGCG CCCCATATGT 'TAGAAACACC CCGCAGTTCA CCAAGCCTCT GAAGGAACCA 84 0 
GGCCTTGGGC AACTCTGTTT TAAGCAGCTT GGCGAGGGCC TACGGCCGGC TCTGCCTCGA 900 
TCAGAACTCC ACAAACTGAT CAGCCCCTTG CAATGTCTGA ACCACGTGTG GAAACTGCAC 960 

CACCCCCAGG ACGGAGGCCC CCTGCCCCTG CCCACGCACC CCTTCCCCTA TAGCAGACTG 102 0 

CCTCATCCCT TCCCATTCCA CCCTCTCCAG CCCTGGAAAC CTCACCCTCT GGAGTCCTTC 108 0 

CTGGGCAAAC TGGCCTGTGT AGACAGCCAG AAACCCTTGC CTGACCCACA CCTGAGCAAA 114 0 

CTGGCCTGTG TAGACAGTCC AAAGCCCCTG CCTGGCCCAC ACCTGGAGCC CAGCTGCCTG 1200 

TCTCGTGGTG CCCATGAGAA GTTTTCTGTG GAGGAATACC TAGTGCATGC TCTGCAAGGC 12 60 

AGCGTGAGCT CAAGCCAGGC CCACAGCCTG ACCAGCCTGG CCAAGACCTG GGCAGCACGG 132 0 

GGCTCCAGAT CCCGGGAGCC CAGCCCCAAA ACTGAGGACA ACGAGGGTGT CCTGCTCACT 138 0 

GAGAAACTCA AGCCAGTGGA TTATGAGTAC CGAGAAGAAG TCCACTGGGC CACGCACCAG 14 4 0 

CTCCGCCTGG GCAGAGGCTC CTTCGGAGAG GTGCACAGGA TGGAGGACAA GCAGACTGGC 1500 

TTCCAGTGCG CTGTCAAAAA GGTGCGCCTG GAAGTATTTC GGGCAGAGGA GCTGATGGCA 15 60 

TGTGCAGGAT TGACCTCACC CAGAATTGTC CCTTTGTATG GAGCTGTGAG AGAAGGGCCT 1620 

TGGGTCAACA TCTTCATGGA GCTGCTGGAA GGTGGCTCCC TGGGCCAGCT GGTCAAGGAG 168 0 

CAGGGCTGTC TCCCAGAGGA CCGGGCCCTG TACTACCTGG GCCAGGCCCT GGAGGGTCTG 17 4 0 

GAATACCTCC ACTCACGAAG GATTCTGCAT GGGGACGTCA AAGCTGACAA CGTGCTCCTG 18 00 

TCCAGCGATG GGAGCCACGC AGCCCTCTGT GACTTTGGCC ATGCTGTGTG TCTTCAACCT 18 60 

GATGGCCTGG GAAAGTCCTT GCTCACAGGG GACTACATCC CTGGCACAGA GACCCACATG 1920 



C^j^GCTGC GACGCCAAGG TGGATGTCTG 



GCTCCGGAGG TGGTGCTGGG cSHaGCTGC GACGCCAAGG TGGATGTCTG GTOCAGCTGC 198 0 

TGTATGATGC TGCACATGCT CAACGGCTGC CACCCCTGGA CTCAGTTCTT CCGAGGGCCG 204 0 

CTCTGCCTCA AGATTGCCAG CGAGCCTCCG CCTGTGAGGG AGATCCCACC CTCCTGCGCC 2100 

CCTCTCACAG CCCAGGCCAT CCAAGAGGGG CTGAGGAAAG AGCCCATCCA CCGCGTGTCT 2160 

GCAGCGGAGC TGGGAGGGAA GGTGAACCGG GCACTACAGC AAGTGGGAGG TCTGAAGAGC 2220 

CCTTGGAGGG GAGAATATAA AGAACCAAGA CATCCACCGC CAAATCAAGC CAATTACCAC 228 0 

CAGACCCTCC ATGCCCAGCC GAGAGAGCTT TCGCCAAGGG CCCCAGGGCC CCGGCCAGCT 234 0 

GAGGAGACAA CAGGCAGAGC CCCTAAGCTC CAGCCTCCTC TCCCACCAGA GCCCCCAGAG 2 4 00 

CCAAACAAGT CTCCTCCCTT GACTTTGAGC AAGGAGGAGT CTGGGATGTG GGAACCCTTA 24 60 

CCTCTGTCCT CCCTGGAGCC AGCCCCTGCC AGAAACCCCA GCTCACCAGA GCGGAAAGCA 2520 

ACCGTCCCGG AGCAGGAACT GCAGCAGCTG GAAATAGAAT TATTCCTCAA CAGCCTGTCC 2 580 

CAGCCATTTT CTCTGGAGGA GCAGGAGCAA ATTCTCTCGT GCCTCAGCAT CGACAGCCTC 2 64 0 

TCCCTGTCGG ATGACAGTGA GAAGAACCCA TCAAAGGCCT CTCAAAGCTC GCGGGACACC 27 00 

CTGAGCTCAG. GCGTACACTC CTGGAGCAGC CAGGCCGAGG CTCGAAGCTC CAGCTGGAAC 27 60 

ATGGTGCTGG CCCGGGGGCG GCCCACCGAC ACCCCAAGCT ATTTCAATGG TGTGAAAGTC 282 0 

CAAATACAGT CTCTTAATGG TGAACACCTG CACATCCGGG AGTTCCACCG GGTCAAAGTG 2880 

GGAGACATCG CCACTGGCAT CAGCAGCCAG ATCCCAGCTG CAGCCTTCAG CTTGGTCACC 2 94 0 

AAAGACGGGC AGCCTGTTCG CTACGACATG GAGGTGCCAG ACTCGGGCAT CGACCTGCAG 3000 

TGCACACTGG CCCCTGATGG CAGCTTCGCC TGGAGCTGGA GGGTCAAGCA TGGCCAGCTG 30 60 

GAGAACAGGC CCTAACCCTG CCCTCCACCG CCGGCTCCAC ACTGCCGGAA AGCAGCCTTC 3120 

CTGCTCGGTG CACGATGCTG CCCTGAAAAC ACAGGCTCAG CCGTTCCCAG GGGATTGCCA 3180 

GCCCCCCGGC TCACAGTGGG AACCAGGGCC TCGCAGCAGC AAGGTGGGGG CAAGCAGAAT 32 4 0 

GCCTCCCAGG ATTTCACACC TGAGCCCTGC CCCACCCTGC TGAAAAAACA TCCGCCACGT 3300 

GAAGAGACAG AAGGAGGATG GCAGGAGTTA CCTGGGGAAA CAAAACAGGG ATCTTTTTCT 33 60 

GCCCCTGCTC CAGTCGAGTT GGCCTGACCC GCTTGGATCA GTGACCATTT GTTGGCAGAC 34 20 

AGGGGAGAGC AGCTTCCAGC CTGGGTCAGA AGGGGTGGGC GAGCCCTTCG GCCCCTCACC 34 80 

CTCCAGGCTG CTGTGAGAGT GTCAAGTGTG TAAGGGCCCA AACTCAGGTT CAGTGCAGAA 354 0 

CCAGGTCAGC AGGTATGCCC GCCCGTAGGT TAAGGGGGCC CTCTAAACCC CTTGCCTGGC 3 600 

CTCACCTGGC CAGCTCACCC CTTTTGGGTG TAGGGGAAAA GAATGCCTGA CCCTGGGAAG 3660 

GCTCCCTGGT AGAATACACC ACACTTTTCA GGTTGTTGCA ACACAGGTCC TGAGTTGACC 3720 

TCTGGTTCAG CCAAGGACCA AAGAAGGTGT GTAAGTGAAG TGGTTCTCAG TCCCCAGACA 37 80 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 947 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Ala Val Met Glu Met Ala Cys Pro Gly Ala Pro Gly Ser Ala Val 
15 10 15 

Gly Gin Gin Lys Glu Leu Pro Lys Pro Lys Glu Lys Thr Pro Pro Leu 
20 25 30 

Gly Lys Lys Gin Ser Ser Val Tyr Lys Leu Glu Ala Val Glu Lys Ser 
35 40 45 

Pro Val Phe Cys Gly Lys Trp Glu lie Leu Asn Asp Val lie Thr Lys 
50 ' 55 60 

Gly Thr Ala Lys Glu Gly Ser Glu Ala Gly Pro Ala Ala lie Ser lie 
65 70 75 80 

lie Ala Gin Ala Glu Cys Glu Asn Ser Gin Glu Phe Ser Pro Thr Phe 
85 90 95 



J or 



# 




Ser Glu Arg lie Phe lie Ala Gly Ser Lys Gin Tyr Ser Gin Ser Glu 
100 105 110 

Ser Leu Asp Gin lie Pro Asn Asn Val Ala His Ala Thr Glu Gly Lys 
115 120 125 

Met Ala Arg Val Cys Trp Lys Gly Lys Arg Arg Ser Lys Ala Arg Lys 
130 135 140 

Lys Arg Lys Lys Lys Ser Ser Lys Ser Leu Ala His Ala Gly Val Ala 
145 150 155 160 

Leu Ala Lys Pro Leu Pro Arg Thr Pro Glu Gin Glu Ser Cys Thr He 
165 170 175 

Pro Val Gin Glu Asp Glu Ser Pro Leu Gly Ala Pro Tyr Val Arg Asn 
180 185 190 

Thr Pro Gin Phe Thr Lys Pro Leu Lys Glu Pro Gly Leu Gly Gin Leu 
195 200 205 

Cys Phe Lys Gin Leu Gly Glu Gly Leu Arg Pro Ala Leu Pro Arg Ser 
210 215 220 

Glu Leu His Lys Leu He Ser Pro Leu Gin Cys Leu Asn His Val Trp 
225 230 235 240 

Lys Leu His His Pro Gin Asp Gly Gly Pro Leu Pro Leu Pro Thr His 
245 250 255 

Pro Phe Pro Tyr Ser Arg Leu Pro His Pro Phe Pro Phe His Pro Leu 
260 265 270 

Gin Pro Trp Lys Pro His Pro Leu Glu Ser Phe Leu Gly Lys Leu Ala 
275 " 280 285 

Cys Val Asp Ser Gin Lys Pro Leu Pro Asp Pro His Leu Ser Lys Leu 
290 295 300 

Ala Cys Val Asp Ser Pro Lys Pro Leu Pro Gly Pro His Leu Glu Pro 
305 310 315 320 

Ser Cys Leu Ser Arg Gly Ala His Glu Lys Phe Ser Val Glu Glu Tyr 
325 330 335 

Leu Val His Ala Leu Gin- Gly Ser Val Ser Ser Ser Gin Ala His Ser 
340 345 350 

Leu Thr Ser Leu Ala Lys Thr Trp Ala Ala Arg Gly Ser Arg Ser Arg 
355 360 365 

Glu Pro Ser Pro Lys Thr Glu Asp Asn Glu Gly Val Leu Leu Thr Glu 
370 375 380 

Lys Leu Lys Pro Val Asp Tyr Glu Tyr Arg Glu Glu Val His Trp Ala 
385 390 395 400 

Thr His Gin Leu Arg Leu Gly Arg Gly Ser Phe Gly Glu Val His Arg 
405 410 415 




Giy rRhe Gin Cys Ala Val Lys L,ys Val Arg 
^ 425 430 




Leu Glu Val Phe Arg Ala Glu Glu Leu Met Ala Cys Ala Gly Leu Thr 
435 440 445 

Ser Pro Arg He Val Pro Leu Tyr Gly Ala Val Arg Glu Gly Pro Trp 
450 455 460 

Val Asn He Phe Met Glu Leu Leu Glu Gly Gly Ser Leu Gly Gin Leu 
465 470 475 480 

Val Lys Glu Gin Gly Cys Leu Pro Glu Asp Arg Ala Leu Tyr Tyr Leu 
485 490 495 

Gly Gin Ala Leu Glu Gly Leu Glu Tyr Leu His Ser Arg Arg He Leu 
500 505 510 

His Gly Asp Val Lys Ala Asp Asn Val Leu Leu Ser Ser Asp Gly Ser 
515 520 525 

His Ala Ala Leu Cys Asp Phe Gly His Ala Val Cys Leu Gin Pro Asp 
530 535 540 

Gly Leu Gly Lys Ser Leu Leu Thr Gly Asp Tyr He Pro Gly Thr Glu 
545 550 555 560 

Thr His Met Ala Pro Glu Val Val Leu Gly Arg Ser Cys Asp Ala Lys 
565 570 575 

Val Asp Val Trp Ser Ser Cys Cys Met Met Leu His Met Leu Asn Gly 
580 585 590 

Cys His Pro Trp Thr Gin Phe Phe Arg Gly Pro Leu Cys Leu Lys He 
595 600 605 

Ala Ser Glu Pro Pro Pro Val Arg Glu He Pro Pro Ser Cys Ala Pro 
610 615 620 

Leu Thr Ala Gin Ala He Gin Glu Gly Leu Arg Lys Glu Pro He His 
625 630 635 640 

Arg Val Ser Ala Ala Glu Leu Gly Gly Lys Val Asn Arg Ala Leu Gin 
645 650 655 

Gin Val Gly Gly Leu Lys Ser Pro Trp Arg Gly Glu Tyr Lys Glu Pro 
660 665 670 

Arg His Pre Pro Pro Asn Gin Ala Asn Tyr His Gin Thr Leu His Ala 
675 680 685 

Gin Pro Arg Glu Leu Ser Pro Arg Ala Pro Gly Pro Arg Pro Ala Glu 
690 695 700 

Glu Thr Thr Gly Arg Ala Pro Lys Leu Gin Pro Pro Leu Pro Pro Glu 
705 710 715 720 

Pro Pro Glu Pro Asn Lys Ser Pro Pro Leu Thr Leu Ser Lys Glu Glu 
725 730 735 

Ser Gly Met Trp Glu Pro Leu Pro Leu Ser Ser Leu Glu Pro Ala Pro 



4 * 

740 745 ^o0 

Ala Arg Asn Pro Ser Ser Pro Glu Arg Lys Ala Thr Val Pro Glu Gin 
755 760 765 

Glu Leu Gin Gin Leu Glu lie Glu Leu Phe Leu Asn Ser Leu Ser Gin 
770 775 780 

Pro Phe Ser Leu Glu Glu Gin Glu Gin lie Leu Ser Cys Leu Ser lie 
785 790 795 800 

Asp Ser Leu Ser Leu Ser Asp Asp Ser Glu Lys Asn Pro Ser Lys Ala 
805 810 815 

Ser Gin Ser Ser Arg Asp Thr Leu Ser Ser Gly Val His Ser Trp Ser 
820 825 830 

Ser Gin Ala Glu Ala Arg Ser Ser Ser Trp Asn Met Val Leu Ala Arg 
835 840 845 

Gly Arg Pro Thr Asp Thr Pro Ser Tyr Phe Asn Gly Val Lys Val Gin 
850 855 860 

He Gin Ser Leu Asn Gly Glu His Leu His He Arg Glu Phe His Arg 
865 870 875 880 

Val Lys Val Gly Asp He Ala Thr Gly He Ser Ser Gin He Pro Ala 
885 890 895 

Ala Ala Phe Ser Leu Val Thr Lys Asp Gly Gin Pro Val Arg Tyr Asp 
900 905 910 

Met Glu Val Pro Asp' Ser Gly He Asp Leu Gin Cys Thr Leu Ala Pro 
915 920 925 

Asp Gly Ser Phe Ala Trp Ser Trp Arg Val Lys His Gly Gin Leu Glu 
930 935 940 

Asn Arg Pro 
945 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide PCR 

primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CAGGATCCTC ATGGCTGCAG CTAGCGTGAC 
(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs f 4 

(B) TYPE: nucleic acid 7 ■ 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear if 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide PCR 



primer' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGTCGACTTA GAGCCCTGTC AGGTCCACAA TG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide probe 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GATGCCATTG GGGATTTCCT CTTT 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide probe 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CAGTAAAGAG GAAATCCCCA ATGG 



